Parallel Search Through Statistical Semantic Spaces Leveraging Linked Open Data

نویسنده

  • Alexey Cheptsov
چکیده

With billions of triples in the Linked Open Data cloud, which continues to grow exponentially, challenging tasks start to emerge related to the exploitation and reasoning of Web data. A considerable amount of work has been done in the area of using Information Retrieval (IR) methods to address these problems. However, although applied models work on the Web scale, they downgrade the semantics contained in an RDF graph by observing each physical resource as a ’bag of words (URIs/literals)’. Distributional statistic methods can address this problem by capturing the structure of the graph more efficiently. However, these methods are computationally expensive. In this paper, we describe the parallelization algorithm of one such method (Random Indexing) based on the Message-Passing Interface technology. Our evaluation results show super linear improvement

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Effective Indicators on the Desirable Quality of Open and Semi-Open Spaces of Contemporary Housingry Housing

Today’s housing, as a private realm of human life, has changed in comparison with the past which was made regardless of quality, desirability, and lack of paying attention to the human needs and its psychological consequences; That is to say, residential units have turned into a set of closed spaces and their open and semi-open spaces have been merged into the urban spaces which disrupted their...

متن کامل

Urban Open Spaces Supporting Physical Activity and Promoting Citizen’s Health: A Systematic Review

Background and Objective: Due to the limited individual approach to behavior change, health promotion researchers use community-based initiatives to understand the factors affecting physical activity and promote the health of citizens. Urban open spaces can facilitate participation in physical activity and the health of citizens. The aim of this study is to identify the indicators, attributes, ...

متن کامل

Widget-based Exploration of Linked Statistical Data Spaces

Today, public statistical data plays an increasingly important role both in public policy formation and as a facilitator for informed decision-making in the private sector. In line with the increasing adoption of open data policies, the amount of data published by governments and organizations on the web is growing rapidly. To increase the value of such data, the W3C recommends the RDF Data Cub...

متن کامل

Leveraging Linked Data Analysis for Semantic Recommender Systems

Traditional (Web) link analysis focuses on statistical analysis of links in order to identify “influencial” or “authorative” Web pages like it is done in PageRank, HITS and their variants [10]. Although these techniques are still considered as the backbone of many search engines, the analysis of usage data has gained high importance during recent years [12]. With the arrival of linked data (LD)...

متن کامل

LODDO: Using Linked Open Data Description Overlap to Measure Semantic Relatedness between Named Entities

Measuring semantic relatedness plays an important role in information retrieval and Natural Language Processing. However, little attention has been paid to measuring semantic relatedness between named entities, which is also very significant. As the existing knowledge based approaches have the entity coverage issue and the statistical based approaches have unreliable result to low frequent enti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014